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ABSTRACT 


A machine diagnosis scheme, generally makes use of a knowledge based technique to 
identify the reasons for departure of the machine from its normal behavior The present 
study aims to explore the possibility of employing Artificial Neural Networks (ANN) for 
condition monitoring apphcations m rotating machinery ANN offer a great deal of 
promise m such a monitoring strategy, especially m case of systems like Power Plants, 
where the number of components/sy stems are too many and complex to be mathematically 
modeled appropriately In the present study a user-interactive software for neural network 
simulator has been developed for prediction of faults m rotating machinery employing the 
back-propagation algorithm. Experiments were conducted on an existing laboratory rotor- 
ng Various types of faults were mtroduced in the set-up and processed signals were 
employed to tram the neural network The adaptability of various neural network 
architectures has been investigated Neural Network training was earned out with the 
chosen architecture till desired degree of convergence is achieved The network is finally 
tested and vahdated for test data with unknown faults An overall success rate between 
75-85 % is observed 
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1. INTRODUCTION 


Supervision of machines health is a subject of increased importance m the course of 
progressive automation Machinery condition monitoring is essential in order to anticipate 
problems m time so as to prevent complete failure It serves the important task of ensuring 
operation withm design constraints Vibration analysis has proved to be a powerful tool 
for analysis of failure in rotatmg machinery Vibration analysis with appropriate 
instrumentation can provide reliable knowledge of machine condition and allows relatively 
inexpensive repair on a pre-scheduled controlled basis in comparison to catastrophic 
failure whose repair may cost much more Condition monitoring based on vibration 
signature analysis is a custom built system for effective product quality assurance or 
machinery’ fault diagnosis 

Advances m data acquisition techniques and the dev elopments in the areas of knowledge 
acquisition, processing and programming offer potential for the application of automated 
strategies for machinery process supervision which would permit early detection of faults 
than is possible through conventional limi ts and trend checks Meaningful and more 
efficient strategies for condition monitoring of rotatmg machinery are a pressing need, due 
to increasing demands on reliability and safety for power generating systems 

There are three phases in the evolution of on-bne computer-based vibration monitoring of 
rotatmg machinery The first phase includes collecting the vibration data and other 
operatmg parameters that are relevant Phase two consists of identifying the complicated 
but normal behavior of the system, accounting for the variation of operatmg parameters 
In phase three, use is made of knowledge based systems (expert systems) to identify' the 
reasons for departures from normal behavior 

Artificial Neural Networks (ANN) find their role in the third phase, of a monitoring 
strategy' ANNs are also commonly referred to as Neural Network connections, Adaptive 
Networks, Neuron Computers and parallel distribution processors ANNs are massively 
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parallel interconnected networks of single processing elements intended to interact with 
the real-world in the same way as biological nervous system Neural Networks ha\e been 
used to solve a wide variety of science and engineering problems that involve extracting 
useful information from complex or uncertain data In Power plant engineering, especially 
the fault diagnosis of various processes like nuclear power plant, chemical process control, 
monitoring of turbomachinery and design automation of engineering structures are the 
fields m which adoption of Neural Network performance is quiet promising 

In the present study, a user-interactive software for a neural network simulator was 
developed for prediction of faults m rotating machinery The network employs back- 
propagation algorithm. It is written in ANSI-C and uses non-linear model of a neuron The 
simulator has an in-built input vector normalization Experiments were conducted on an 
existing laboratory- rotor-ng Five different primary faults and their combinations were 
introduced in the experimental set-up The vibration signals collected through 
piezoelectric transducers from the bearing blocks were employed to train the network The 
adaptability of various neural network architectures was also investigated Neural network 
training was earned out with the chosen architecture till a desired degree of convergence 
is achieved The network w'as finally tested and validated for test data with unknown 
faults An overall success rate between 75-85 % was observed 

A bnef review' of the commonly occurring faults in rotating machinery, currently available 
strategies for fault diagnosis and expert systems for fault diagnosis are discussed m 
Chapter 2 The basic principles of artificial neural netw'orks principles and the back- 
propagation algonthm for fault diagnosis are described in Chapter 3 Chapter 4 describes 
the experimental work, signal processing, training and testing of the network and its 
validation 
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2. DIAGNOSIS OF ROTATING MACHINERY 


The complexity of the dynamics of rotating machinery depends on the size as well as the 
operating speed On account of ever increasing demand for electric power and high speed 
transportation, the rotating machinery is being made increasingly light and flexible, 
thereby increasing the complexities of its dynamics Any fault in the system results in 
corresponding changes in the behaviour of the system and vibration signatures of the 
machine become important indicators of the health of the system Typical rotating 
machinery installations involve mechanical elements like rigid or flexible shafts carrying 
one or more discs with bladed components The arrangements may be supported in 
rolling element or fluid film bearings with associated components like coupling seais, 
gear-boxes, pulleys, etc The installation is carried by pedestals which are mounted on 
foundations The commonly occurring faults m rotating machinery are cracks on shafts, 
loss of blades on discs deteriorating balance, bows of shafting, shaft misalignment, oil 
film whirls, gear misalignment and damage, bearing eccentricities coupling inaccuracies 
and damage, pedestal and foundation cracks etc 

2.1 Problems in Rotating Machinery 

Rotordynamics has been an area of extensive research during the past two decades 
Reference can be made to the books by Rao(1992), Childs(1993), Dimentberg(1988) for a 
comprehensive discussion m the field, including fault classification and diagnosis Some 
of the problems m the rotating machines can be broadly listed as follows 

Unbalance: 

Unbalance m machinery arises due to eccentricities between geometrical and mass centers 
m components These unbalances generate large centrifugal forces m the machine which 
cause vibrations at the rotor speed e g one cycle per revolution Some specific unbalances 
created during the running of the machine give rise to additional characteristics 
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Distortion (casing and foundation ): 

Casing and foundation distortion cause vibration m an indirect way either by generating 
misalignment between driver and driven machines or by causmg internal rub or uneven 
bearing contact This in turn transmits forces to the rotor, inducing it to generate forces of 
its own. such as unbalance and a wide variety of oil film and friction mduced forces 
Another possibility is that the loads on casing supports shift and can set off a senes of 
resonance problems Piping forces and foundation distortion often cause this type of 
difficulty 

Misalignment: 

Coupling misalignment causes friction or deflection forces m the coupling and this in turn 
causes the rotor and bearing system to deflect and to amplify these forces possibly creating 
secondary phenomenon such as harmonic resonance which can become severe 
Frequencies are characteristically 2 x running frequency (RF) for gear coupling, but 1 x RF 
is also often observed 

Bending Resonance: 

A rotor runs into resonance when the rotational speed is equal to its lateral (bending) 
vibration frequency Bending critical speeds can be easily detected by synchronous whirl 
conditions and fairly large amplitude at the rotor speed The critical speed region occurs 
over a fairly large range m the case of oil film bearings and is often accompanied by 
backward whirl Resonance of then structures, support and auxiliaries cause fanly large 
amplitudes of vibration at the rotor speed 

Asymmetric shaft: 

The response of the asymmetric shaft has several harmonics and the frequencies observed 
are 1 x rev, 2 x rev, 3 x rev and sometimes even harmonics, if the symmetry is 
predominant 

Torsionals: 

Torsional cnticals m machinery are detected by torsional vibration measuring instruments 
Unless there is a gear box m the transmission unit, it is difficult to detect these cnticals by 



vibronieiers or accelerometers used for com entional non-angular vibratory measurements 
Torsional cnticals are predominant in reciprocating madmen,, as there is a large 
excitation torque In rotating machinery, these are not easily excited, unless there is a 
disturbance arising out of gear transmission units Reciprocating machines have several 
cnticals some of them major, between the starting and operating speeds 

Oil-film whirl. 

Instabilities in rotor can also anse due to oil film forces in the bearings This is self excited 
type of motion and is called oil whirl or oil whip The frequency of response at the onset 
of the whirl is always in the region of 0 4 to 0 5 x rev 

Mechanical Looseness: 

Looseness of components is the likely cause for most number of problems Loose rotor 
components such as disks, sleeves, thrust collars etc . cause internal friction problems 
Internal friction is a destabilizing factor The frequency of vibration is always the rotor 
critical speed and hence easy to detect Mechanical loose components like bolts, give rise 
to 1 x rev and harmonic frequency signals due to secondary phenomena The amplitude 
and phase continually change Loose assembly of bearings give nse to subharmomc 
response and the typical frequency response is 1/2 x rev and 1/3 x rev 

Sub-harmonic Resonance: 

This is a subsynchronous phenomenon that occurs at a frequency exactly equal to one- 
third one-fourth, or one-half of the operating speed One of the components of the 
nonlinear system usually vibrates at a natural frequency of the machine system, the 
resulting amplified vibration is termed a subharmomc resonance The condition can be 
corrected in several ways, by changing the natural frequency of the machine system, by 
removing the source of the nonlinear behavior (e g support pedestal, bearing), or by 
reducing the vibratory force that induces the nonlinear behavior 

The most prominent type of rotatmg machinery problems, their symptoms and remedies 
are listed m Table 2 1 (Ehnch, 1992) 
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Table 2.1. Identification and correction of malfunction of rotating machinery 


Fault 

Frequency 

Spectrum, Time-domain, 
Orbit shape 

Correction 

Mass unbalance 

IX 

Distinct IX with much lower 
values of 2X, 3\ etc , 
elliptical and circular orbits 

Field or shop balancing 

Misalignment 

1\, 2X, etc 

Distinct IX with equal or 
higher values of 2\, 3A etc , 

Perform hot and/or cold 
alignment 

Shaft bow 

IX 

Dropout of vibration around 
critical speed in Bode plot 

Heating or peemng to 
straighten rotor 

Steam loading 

IX 

Load sensitive IX 

_ ..... . . _ 

modifv admission sequence 

repair diaphragms install 
nozzle blocks properly 

Bearing wear 

IX, subharmonics, 
orders 

High IX, high 0 5X, 
sometimes 1 5 or orders 
cannot be balanced 

Replace bearing 

Gravity 

excitation 

2X 

0 5 critical speed appears on 
Bode plot 

R1SH9H 

Cracked rotor 

IX, 2X 

High 1X,0 5 critical speeds 
mav show upon coast-down 

Remove rotor 

Looseness 

IX plus large number 
of orders, 0 5X ma> 
show up 

High IX with lower-level 
orders large 0 5 order 

Shim and tighten bolts to 
obtain rigidity 

Coupling 

lockup 

IX, 2X, 3X etc , 

IX with high 2X similar to 
misalignment start and stops 
mav yield different vibration ! 
patterns ! 

Replace coupling or remove 
sludge 

Thermal 

instability 

IX 

IX has varying phase angle 
and amplitude 

Compromise balance or 
remove problem 

Chl-Whirl 

0 35X to 0 47 X 

Subsynchronous component ! 
less than 0 5 order informal 
loop in orbit 

! 

Temporary load bearing 

heavier, correct 
misalignment longterm 
change bearing type 


fel 

Subsynchronous component 
does not change with speed 

Change bearing type 

Subharmomc 

resonance 

0 5X.1/3X, 0 25X, 
and higher 

Subsynchronous Vibration 
depending on natural 
frequency 

Remove looseness, 
excessive flexibility change 
natural frequency so it does 
not match fractional 
frequencies 

Hysteresis 

0 65X to 0 85 X 

High- magnitude fractional 
£requency(greater than 0 5x) 

Eliminate or secure built-up 
parts 

Rubs 

0 25X,1/3X, 0 5X, or 
orders 

External loops in orbit 

Eliminate condition such as 
thermal bow and mass 
unbalance that cause rub 

Trapped Fluid 

0 8X to 0 9X 

Bearing-sum and different 
frequencies 

Remove fluid m rotor if 
possible, otherwise 
eliminate IX component 


lX=One time operating speed fnl=First natural frequency 
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2.2 Analysts and Diagnosis Of Rotating Machinery 

Experimental and analytical techniques, both have been routinely used for diagnosing 
faults m operational rotating machinery These techniques are also used during the design 
and development processes Vibration analysis for fault diagnosis of large rotating 
machinery is finding increased application due to increased requirement of electric power 
and efficient transmission of power between shafts m power plants Parameter 
identification techniques also find wide use to quantify- natural frequencies, mode shapes, 
damping stability and frequency-response characteristics Some of these diagnostic 
techniques are listed m Table 2 2 


Table 2.2. Diagnostic Techniques for Rotating Machinery 


iBHu&SEEIISSHi 

Use 

Description 


Time-domain 

analysis 

Modulation, Pulses, Phase, 
Truncation glitch 

Amplitude versus time 

Analog and digital 
oscilloscope, FFT 
spectrum analyzer 

Orbital analysis 

Shaft motion 
subsynchronous whirl 

Relative displacement of 
rotor bearing inXY 
directions 

Digital vector 
filter. Oscilloscope 

Spectrum analysis 

Direct frequencies, natural 
frequencies, sidebands, beats, 
subharmonics, sum and 
difference frequencies 

Amplitude versus 
frequency 

FFT spectrum 
analyzer 

Cepstrum analysis 

Sideband and harmonic 
frequency measurement, 
accurate quantification of 
sideband and harmonic 
seventy 

Inverse Founer 
transform of logarithmic 
power spectrum 

FFT spectrum 
analyzer 


2.2.1 Tune Domain Or Wave Form Analysis: 

Time domain signal is an unprocessed record of the vibration as picked up from a 
transducer It consists of information about the physical behavior of machine A senous 
limitation is that it can be too complex for analysis if excessive noise, signal modification 
or several frequencies are present The processing can be done, but it should be minimized 
whenever possible so that errors are not mtroduced mto the data In the time domain, 
frequency and phasing are two important factors that are to be evaluated for fault 
diagnosis Valuable information is provided by signal shape, le truncation, pulses, 
modulation, glitch or shaft-induced signals obtained from a proximity probe that are 
caused by scratches on the shaft Time between events represents the frequency 


- 7 - 
















components within the machine The phase between two signals provides information 
about vibratory behavior that can be used to diagnose a fault such as misalignment 

2.2.2 Orbital Amah sis: 

The horizontal and vertical motions of the rotor with respect to a sensor mounted on the 
bearing are simultaneously displayed on oscilloscope This is called the orbit plot The 
orbit represents the instantaneous position of the rotor The orbit can be used to access oil 
whirl and other asynchronous motions as well as synchronous phenomena such as mass 
unbalance and misalignment Orbital analysis is possibly the best way to distinguish 
between mass unbalance and misalignment, both of which cause once-per revolution force 

2.2.3 Spectrum Analysis: 

Spectrum analysis is conducted with an FFT algorithm or filters The processed signal 
gives information about the amplitude and phase content at various frequencies 
Frequencies of vibration response can be related to direct excitation frequencies or then- 
orders, natural frequencies, sidebands, subharmoxucs and sum-difference frequencies 
Frequencies can be identified directly in a linear system, because the frequencies of the 
measured vibration response are equal to those of the forces causing the vibration When a 
minimum number of frequencies are present, signal-resolution is required, and electric 
filters can be used to analyze the spectrum. The operating speed of the machine is usually 
the fundamental frequency used in the analysis The frequency at operating speed is 
identified and then linked to other frequencies in the spectrum such as direct orders 
(multiple) of operating speed These multiple orders are due to a variety of faults that are 
present m the rotating machinery The exact fault can be predicted by comparing with 
good test data that is directly proportional to the information available on the design of a 
machine and its working mechanisms This is especially true when similar frequencies are 
obtained for different faults, e g mass unbalance and misalignment 
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2.2.4 Cepstrum analysts: 

Cepstrum is defined as the inverse Fourier transform of the logarithmic power spectrum 
commonh used in vibration analysis The complex cepstrum is reversible to a time signal 
The terminology used m cepstrum analysis is analogous to that used in spectrum. A 
cepstrum domain consists of data processed one step further than the frequency domain A 
cepstrum is effective for accurately' measuring frequency spacing, harmonic and sideband 
patterns m the power spectrum One component of the cepstrum represents the global 
power content of an entire famil> of harmonics or sidebands The seventy of a defect such 
as that in a rolling element bearing is provided by one component of the cepstrum display 

2.2.5 Expert Systems For Fault Diagnosis 

Conventional manual trend checking methods and the above discussed diagnosing 
procedures have then - own stipulations and often require human expens to analyze the 
vibration signature, which are expensrv e and do not yield the exact prediction of faults 
This has motivated the dev elopment of Expert systems or Knowledge based systems for 
accurate fault diagnosis of rotor dynamic installations, which are cost effective and time 
saving Expert systems are intelligent computer programs that are based on w ; ell defined 
algorithms which exhibit within a specific domain, a degree of expertise as a prediction 
system, diagnostic system, design system, planning system, speech recognition system 
monitoring system etc These are often considered to be one of the most popular 
application of artificial intelligence (AI) Artificial neural networks (ANN) are often 
thought of as distinct from the Al, though the ANNs are common in AI literature ANNs 
simulate the biological process of human bram and nervous system whereas AI programs 
attempt to identify and execute higher level process and relationship that are often based 
on logic and linguistics 

Knowledge base is the most important part of any expert system It is stored m the form of 
facts and rules The facts are termed as deep knowledge and rules are simply the 
heuristics The knowledge that is stored is controlled by an inference engine The inference 
engine interacts with the user and uses the facts in the knowledge base according to the 
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rules contained in it Since the knowledge or data m most cases may be incomplete or 
uncertain the models employ probabilistic reasoning technique such as Bayes’s rule fuzzy 
logic Dempster- Shafer calculus etc Reference can be made to the work ofNema (1994), 
who employed probabilistic reasoning techmques for the development of an Expert Shell 
for diagnosis of rotatmg machinery' ANNs are increasingly becoming popular due to their 
capabilities of organizing and identifying uncertain patterns This feature of the ANN, 
motivates the researchers to diagnose the various malfunctions of the machinery' 

2.3 Remarks 

In the present study an ANN is developed to diagnose the rotating machinery' 
malfunctions The knowledge is acquired through standard learning techniques Each 
segment of the acquired knowledge gets a weightage which is related to its importance 
These weights derrv e a functional relation between input and output patterns, during the 
learning process 

The relevant features of Artificial Neural Networks are described in the next chapter 
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3. FEATURES OF ARTIFICIAL NEURAL NETWORKS 


An Artificial Neural Network can be defined as a massively parallel distributed processor 
that has a natural propensity for storing experiential knowledge and making it available for 
use It resembles the brain in two respects ( 1 ) knowledge is acquired by the network 
through a learning process and (n) inter-neuron connection strengths, known as synaptic 
weights are used to store the knowledge 

Artificial Neural Networks technology is of a relatively recent origin and very few studies 
are a\ ailable which relate to vibrations and dynamics of machinery Most of the work has 
been earned out m the present decade Mayes (1994) appbed ANN for on-line vibration 
monitoring of large turbo-generators The investigations focused on data processing and 
the use of the neural networks were discussed Elkordy, Chang and Lee (1994) 
investigated the applicability of ANN for vibration signature analysis of a five-storey steel 
structure The primary investigations showed that ANNs have considerable potential to 
assess structural damage Trzynadlovvski, Karam. Gandikota and Ghassemzadeh (1994) 
applied the neural net software for the validation and recovery' of the distorted vibration of 
electromachine systems Vibration signals, which represent the lateral displacement of a 
shaft, w'ere obtained from proximity probes and the Discrete Founer coefficients were 
calculated These were used as the inputs to the network to classify 7 fault and no-fault 
signal Fuzzy-input neural net adaptrve expert systems were presented by Dimargonas 
(1995) for rotor diagnosis and prognosis A traditional binary classification system was 
adopted to diagnose the fault in rotating machinery' The two possible states, fault and no- 
fault were described by 1 or 0 Available knowledge for failure diagnosis in 
turbomachinery was utilized to initially teach the system Ahn and Cho (1996) proposed a 
new 7 vibration control scheme using the ANN along with electromagnetic and pneumatic 
principles as a hybrid type active vibration isolation system. The characteristics were 
investigated via computer simulation as well as experimentation The proposed control 
scheme could suppress the transmissibihty of the vibration isolation system to below 0 63 
over the entire frequency range, including the resonance frequency, without complex 



calculation or prior manipulation Haung and Lian (1996) proposed a different hybrid 
scheme v,ith a combination of fuzz} logic and neural network algorithms for active 
vibration control A fuzzy logic controller was designed for controlling the mam 
influence part of the MEMO (multi-input/multi-output) system Very recentl}, Han Xiu 
Wang Chen and Tan (1997 j presented the adaptation of neural network for fast-valve 
controlling of a power generation plant The backpropagation neural network was used to 
train the feed-forward neural network controller The computer simulated 
backpropagation network results were compared with the conventional fast \alving 
methods applied to the same system The investigations proved that the ANN controller, 
has satisfactory generalization capability reliability and accuracy for the critical control 
operation McCormick and Nandi (1997) contemplated the application of ANN for real- 
time classification of rotating shaft conditions The vibration signals were collected from 
the machinery in time-domain Simple signal processing techniques were applied to 
prepare the training vector set The use of the ANN was described to classify the load and 
no-load of the rotating machinery' The network results were compared with the frequency 
domain analysis 

The salient features of Artificial Neural Networks are described m this chapter 

3.1 Model Of A Neuron 

A Neuron is an information processing unit that is fundamental to the operation of a 
neural network Fig 3 1 shows the model of a Neuron The three basic elements of the 
neuron model are described below (Haykin, 1994) 

A Set of Synapses 

Synapses are connecting links, each of which is characterized by a weight or strength of 
its own It specifies the connection between a signal x } at the input of the sample j, and a 
Neuron k The weighting factor is w kj 
An Adder 

Adder sums up the input signals weighted by the respective synapses of the Neuron The 
operation is similar to that of a linear combiner 



An Actn ation Function 


Activation Function limits the amplitude of the output of the Neuron The activation 
function is also referred to as a squashing function 



weights 


Output 

'ft 


Fig 3 1 Nonlinear Model Of Neuron 

As shown in the above figure (Fig 3 1 ), the model of a Neuron also includes an externally 
applied threshold function 9 k . that has the effect of lowering the net input of the 
activation function On the other hand, the net input of the activation function may be 
increased by employing a bias term rather than a threshold The bias is the negative of the 
threshold In mathematical terms a neuron k is described by 

p 

u * = > y\=<p ("* - 0 * ) = <p( v k ) ( 3 !) 

where x p are the input signals. w 41 ,w 42 are the synaptic weights of 

the neuron 1 is the linear combiner output, 6 K is the threshold, <p( ) is the activation 
function and is the output signal of the neuron v k is the output from the neuron after 
summation 

Equations (3 1) can be equivalently written as 


p 


< 

II 

M 

* 

Jr 

(3 2) 

J* o 


T* = 9( v k ) 

(3 3) 


_T 'J- 




In equation (3 2) a new synapse has been added, whose input and weight, respectiveh are 
x ( =-l w* o =0* (3 4) 

and therefore the model of the Neuron k can be reformulated as m Fig 3 2 
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Fig 3 2 Modified Nonlinear Model Of Neuron 

3.2 Activation Function 

The activatrve function denoted by <p( ) defines the output of a Neuron m terms of the 
activity level at its mput The sigmoidal function is by far the most common form of 
activation function used in the construction of the artificial neural networks It is defined 
as a strictly increasing function that exhibits smoothness and asymptotic properties An 
example of the sigmoid is the logistic function defined by (Fig 3 3) 

?( v ) = T- — 7 7 ( 35 > 

1 + exp(-m’) 

where a is the slope parameter of the sigmoid function 
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Fig 3 3 Sigmoid Function 

3.3 Network Architectures 

The manner m which the neurons m a neural network are structured is intimately linked 
with the learning algorithm used to tram the network Four general classes of network 
architectures are 

i) Single layer feed forward network 

This is a network of neurons organized in the form of layers The simplest form of a 
layered network consists of an mput layer of source nodes, that projects on to an output 
layer of neurons but not vice-versa This network is of feed-forward type (Fig 3 4) 
n) Multi-layer feed forward network 

The second class of feed-forw'ard network distinguishes itself by the presence of one or 
more hidden layer The function of hidden neurons is to intervene between the external 
mput and the network output (Fig 3 5) 
m) Recurrent networks 

A recurrent network distinguishes itself from the feed-forward neural network m that it 
has at least one feedback loop For example, a recurrent network may consist of single 
layer of neurons, with each neuron feeding its output signal back to the mput of all other 
neurons (Fig 3 6) 

\i) Lattice structure 

A lattice consists of a one-dimensional, two-dimensional or higher dimensional array of 
neurons with a corresponding set of source nodes that supply the mput signals to the 
arrays (Fig 3 7) The dimension of the lattice refers to the number of dimensions in which 
the graph lies 






Lnpu layer Output Layer 

of source of neurons 

nodes 

Fig 3 4 Single layer Feed forward network 


Inoui {aver 
of source 
nodes 

Fig 3 5 


Lavr of Lave of 

hidde" output 

neurons neurors 

Feed forward network with 
one hidden layer 




Fig 3 6 Recurrent network Fig 3 7 One-dimensional lattice of 

3 neurons 

3.4 Learning Or Training Algorithms 

Most training techniques, for ANNs have been developed from common roots and share 
man y characteristics A brief descnption of some of them is grven below 
Supervised And Unsupervised Learning 

Tr ainin g algo rithms are classified as supervised or unsupervised In supervised training, 
there is a teacher that presents input pattern to the network, compares the resulting output 
with those desired, and then adjusts the network weights m such a way as to reduce the 
difference However, in unsupervised training there is no necessity for the teacher Input 
patterns are applied, and the network self-orgamzes by adjusting its weights according to 
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well-defined algorithms Most of the today’s tra inin g algori thms have evolved from the 
concepts of Hebb (1961), winch involve adjusting the network weights according to the 
product of the excitation level of the source and destination neurons Mathematically. 

M g (t + l) = M ii (t) + Tput l out J (3 6) 

where, u s (/) and w p (t + l) are the weights from neuron i to neuron j, prior and after 
adjustment, respectively rj is the learning-rate coefficient , out t is the output of neuron i 
and mput to neuron j, while out J is the output of neuron j 

3.5 Backpropagation Algorithm (BP A) 

BPA is found to be the most common learning algorithm. It has been tested with a number 
of different problems and has been found to perform well m most cases It is designed to 
solve the problems of choosing weight values for layered artificial neural networks with 
feed- forward connections from mput layer to the hidden layer and then to the output layer 
The BPA performs the mput to output mapping by minimizing a cost function using a 
gradient search technique The cost function (which is equal to the mean squared 
difference between the desired and the actual net output) is minimized by making wide 
connection adjustments according to the error between the computed and target output 
processing element values There are two stages m the development of a backpropagation 
algorithm - namely forward pass and backward pass During the forward pass all the 
weights of the network are initialized randomly and the network outputs and the 
difference between the actual and target output (i.e error) is calculated for the initialized 
weights During the backward step, the initialized weights are adjusted to minimize the 
error by propagatmg the error backwards The network outputs and error are calculated 
again with the updated weights and the process repeats till the error is acceptably small 
The two steps are described below (Fig 3 8) 



Input La\ er Hidden La\er Output La) er Target 



Bias Unit Bias Unit 

Fig 3 8 The three la\er Back propagation network Architecture 


3.5.1 Forward Pass 

The input vector p to the network is x p = (x P \ x pS )' 
where x p represents the input attribute i for the vector p 
The net input values to the hidden layer units are 



UjiM 12 

W 'l„ 

net h = V w h x +6 h 

& l—J J 1 P 1 J 

1=1 

_ ^21^22 






2 " 


r 2 \ + \ "l 


Wj, represents the w'eight of the layer h from node / node to node j and 6 h L represents the 
threshold for the node L of the layer h 

The outputs from the hidden layer( which is input to the output layer) are 

=/><) ( 39 > 

o* is the output from the node j of the hidden layer h,ff is the activation function at 

node j of the hidden layer h 
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The equations (3 8) (3 9) are represented graphical]) in Fig 3 9 
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Fig 3 9 Output calculation at each node in the forward pass 


In the above the net-input values at output layer unit are 




(3 

10) 

the outputs at output layer units are 0°^ = f k ° ( net ^ ) 

(3 

H) 

the individual error at each output unit is 8 ^ = y ^ 

(3 

12) 

1 A 2 

and the mean squared error at all output unit is E p = — 2_) 8 ^ 

2 

(3 

13) 


3.5.2 Backward Pass 

The weights at the output and hidden layer are adjusted to min im ize the mean square error 
m the backward pass While the output layer nodes are associated with the target output 
and the nodes at the hidden layer are not, the weights updating at the output and hidden 
nodes is different These two procedures are described below 
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Updating The Output Laver Weights 

From equation (3 12) and (3 13) E p can be expressed as 

= < 3 14 > 


The weight change of an output layer weight is the negative gradient of E p with respect 
to output layer weights h £ 


t o v 

— = -c>v “ >■ 


dw 


* dimt 0 ^) 0*1 


However, from equation (3 10) 


3*; 


-y<; 

ci. o L~i tg pi * 

J 


- I 


VS 


and from equation (3 11) 

Therefore, the weight change at the output layer weight is 

A > = = (y# ~ °U )// >» 

Now denoting 

the weight change at the output layer weights can be written as 


A> = <r./ 

Z> Z* 


(3 15) 

(3 16) 

(3 17) 

(3 18) 

(3 19) 

(3 20) 


In practice, instead of directly applying the above weight change to the weight at the 
output layer, two network parameters are introduced, namely. Learning Rate Coefficient 
7 ] and Momentum a , m order to make the learning progress smooth and to ensure that 
the weight changes take place always m the same direction 

wl(t + l) = wl(t)+7T{y pk -o 0 pk )f k °'(net° pk )i B +awl(i- 1) (3 21) 

The selection of the learning rate parameter rj , has a significant effect on the network 
performance Usually, tj is a small number, of the order of 0 05 to 0 9 A small value of r\ 
implies that the network will have to make a large number of iterations It is often possible 



to increase the \alue of tj as network error decreases therein increasing the speed of 
comergence towards the target output The other wav to increase the convergence speed 
is b\ adopting an extra momentum term a (equal to a fraction of the previous weight 
change A p w ) while updating the weights This additional term tends to keep the weight 

changes m the same direction The entire weight updating process at output layer can be 
represented graphical!} by Fig 3 10 


Hidden Laver Output Laver Target 



Fig 3 10 Output layer weight updating 
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Updating The Hidden Laver Weights 

The adjustment of weights for the hidden layer nodes is proportional to their individual 
initial contributions there is no target output as m the case of the output layer From 
equations (3 10) and (3 11) 




(3 22) 


The weight change of a hidden layer weights is the negatrv e gradient of E p with respect 
to hidden layer weights w * 


,h 


= y/ v N ^ 

cui v 


k 
' Ji 


dinet 0 ^) a B d{net m ) at' 

From the equations (3 10) to (3 14) the individual terms can be expanded as 

4»<;) ^ ^ 
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av\ 




= x. 


Substitutmg the above equations (3 24 to 3 27) m equation (3 23) 


* & 

A w, = - 

p J> fa 




(3 23) 


(3 24) 


(3 25) 


(3 26) 


(3 27) 


(3 28) 


Network parameters 77 , or can be introduced in a manner similar to that in the case of the 
output layer, to express the final weight change at the hidden layer as 

K ( t+l )= w j. (0 + ( l - °l )Z (>v + K(^-i) ( 3 29 > 
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Tlie entne weight updating process at hidden laser is represented graphical!} m Fig3 1 1 

Input Layer Hidden Laser Output laser 



Fig 3 11 Hidden layer sveight updating 







3.6 Remarks 

The ANN algorithm developed m this stud) emplojs the backpropagation learning 
technique The training is done with processed data until adequate convergence of the 
network was achieved and the stored weights are used to estimate the output patterns 
corresponding to the given mput patterns, during pattern recognition The algorithm is 
written m ANS1-C 
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4. NETWORK DESIGN FOR 
ROTORDYNAMIC APPLICATIONS 

An effective machine diagnosis requires that the diagnosis algorithm is trained for aD the 
fault conditions that need to be identified A\ a liability of such training data, and generation 
of such data in cases where it is not readily available is the primary requirement of any 
diagnosis scheme Separate data sets which are not part of the training sets are required 
for validation and testing of the network The training and the test data, during the present 
study were generated on a laboratory' rotor-ng The rig. data generation, training and 
testing of the algorithm are described m this chapter 

4.1 The Rotor Rig 

The Rig (Fig 4 1) consists of a 10 mm diameter shaft carrying a centrally located steel 
disc weighing 0 5 kg It is supported in identical rolling element bearings (type 6200 SKF 
ball bearings) at two ends and dm en by an 50 Watts, 230 Volts AC/DC electric motor 
through a flexible spider coupling The following faults were deliberately introduced m the 
ng for generatmg training data 

• Rotor with no-fault 

The rotor was balanced, the alignments and fittings were done properly so as to 
presumably classify' it as a system with no-fault 

• Rotor with mass unbalance 

A mass of about 0 05 Kg was added at radius of 25 mm on the rotor disc anc 
unbalance was created 

• Rotor with bearing cap loose 

The cap on the bearing block was loosened so as to create a gap of approximately 1 
mm between the outer race of the bearing and the cap of the bearing block N< 
attempt was made to quantify the fault accurately 
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• Rotor with misalignment 

Misalignment in the rotor was created by shifting the bearing block sideways by about 
3 mm, so that the axis of the two bearing blocks were out of alignment by about 3 
mm. Again no attempt w'as made to quantify’ this fault in strict numerical terms. 

• Play in spider coupling 

The flexible rubber spider in the coupling was removed and a small cut was introduced 
such that a radial clearance was created at the two halves of the coupling at the outer 
diameter of 15 mm. This fault was also not quantified in numerical terms. 

• Rotor with both mass imbalance and misalignment 

In this case mass unbalance and misalignment were both introduced simultaneously in 
the rig. 



r 


Fig 4. 1 . Laboratory arrangement of rotor rig 

These faults were introduced one at a time and the vibration signals of the rotor wer 
picked up from the bearing-block caps by piezo-electric accelerometers (Burel and Kja< 
make, charge sensitivity = 10.20 pc/m/s 2 ), for a range of rotor speeds from 500 to 90 
rpm. The signals were amplified by charge amplifiers (Make 2635, Bruel and Kjaer). If 
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amplified signals were stored in floppy diskettes using a portable dual channel FFT 
anaJ\zer(Ono-Sohhi) for further processing Typical rotor vibrations sensed at the bearing 
cap. b> the accelerometer for each of the six faults introduced in the rotor, are shown in 
Figures (4 2) to (4 7) The signals pertain to a typical rotor speed of 800 RPM The FFT 
of the time signal is also shown m these figures Twenty such vibration signals were taken 
for each of the faults introduced at e\ery speed of rotation Signals were obtained for five 
rotor speeds in the range 500-900 RPM These signals were processed further, to form 
mputs to the training algorithm. 

4.2 Preparation Of The Training Vector Set 

It is evident from the FFT of Figs 4 2(b) to 4 7(b). that there can be a large number of 
inputs (say if amplitude and phase information is used as training data, they need to be 
provided for a reasonably large number of frequencies, to retain all the relev ant features of 
the signal) Other high resolution methods, such as autoregressn e analysis, time frequency 
distribution methods e g wavelet transformation, also generate a large number of mputs 
In order to reduce the number of mputs to the network it is necessary to extract 
stationary features of the vibration signals An efficient way to reduce the number oJ 
mputs, while retaining most of the relevant information, is to extract time-invanani 
features from the stationary time senes of Figs 4 2(a) to 4 7(a) This is easily done bj 
estimating the zero-lag moments (McCormick and Nandi 1997) Smce moments of tuns 
senes characterize the probability density function p(x) of the vibration signal for eacl 
condition, the following d iff erent moments are calculated for the complex time sene 
given by 

z(t) = a h {t)+jaSt) (4 1) 

and differential complex time senes given by 

dz(t) = — *z(t)-x(t-l) (4 2) 

dt 

where a h (t) is the vibration signal from the bearing m the horizontal direction and a, (t 
is the signal m the vertical direction 
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Fig 4 2 (a) Vibration signal in time domain Fig 4 2 (b) Vibration signal in Frequency dom: 

at 800 RPM for No-fa u it at goo RFM for No-fa nit No-fault 



Fig 4.3 (a) Vibration signal in time domain 
at 800 RPM for Play-in-couphng 


Fig 4.3 (b) Vibration signal in Frequency domain 
at 800 RPM for Play-in-coupling 
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Fig 4 6 (a) Vibration signal in time domain 
at 800 RPM for Misalignment 
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1 Fig 4.7 (a) Vibration signal in time domain 
at 800 RPM for Mass unbalance 
and Misalignment 
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ig 4 6 (b) Vibration signal m Frequency domain 


at 800 RPM for Misalignment 
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Fig 4.7 (b) Vibration signal in Frequency domau 
at 800 RPM for Mass unbalance and 
Misalignment 
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The mean of the vibration signal is the First Order Moment and is expressed as 

a c 

E{z{t)} = T)= \;(t)p(x)d, (4 3) 

— oc 

Two types of Second and Higher Order Moments are taken as follows 

OC 

Central moments = E{(z(t)~ T]) k } = \{z(t)~ T])* p(x)d x k = 1,2,3 (44) 

or 

Cross moments (Cross Correlation) r n = - 1) / = 0,±1,±2 (4 5) 

r=-cr 

where, E( ) is the expectation operator and / is the time-lag 

The mean second to fifth central moments of the senes \:{t)\,\cb(t)\ are calculated at 
vanous speeds for all faults mtroduced The moments thus computed for all the faults 
introduced are shown as functions of operating speed m Figs 4 8 to 4 10 



Fig 4 8 (a) Statistical Moments Of |r(/)j 
No-fault 


Fig 4 8 (b) Statistical Moments Of |r(/)| 
Coupling loose 
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Statistical Moments (dB) 



Rotor Speed (RPM) Rotor Speed (RPM) 


Fig 4 8 (c) Statistical Moments Of |r(f)i 
Mass unbalance 


Fig 4 8 (d) Statistical Moments Of \={t ) I 
Bearing cap loose 



Fig 4 8 (e) Statistical Moments Or jr(/)| 
Misalignment 



Rotor Speed (RPM) 


Fig 4.8 (f) Statistical Moments Of |r (t )( 

Mass unbalance And Misalignment 
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Statistical Moments (dB) Statistical M 
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Statistical Cross Moments (dB) Statistical 



Fig 4 9 (e) Statistical Moments Of |<fc(f)| 
Misalignment 



Fig 4.104a) Statistical cross Moments 
No-fault 



Fig 4 940 Statistical Moments Of \dzit] 

Mass imbalance And Misalignment 



Fig 4.104b) Statistical cross Moments 
Coupling loose 
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Fig 4.10 (c) Statistical cross Moments 
Mass unbalance 


Fig 4.10 (d) Statistical Moments 
Bearing cap loose 
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Fig 4.104e) Statistical cross Moments 
Misalignment 


Fig 4.10.(f) Statistical cross Moments 

Mass unbalance And Misalignment 
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4.3 Network Structure 

The Neural Network Algorithm is developed for a feed-forward type architecture 'with one 
or more hidden layers The architecture of such a network is referred m terms of n um erals 
as ( i,jJ,j2 , jn,k) where i denotes the number of neurons m the input layer jn is the 
number of neurons in the n-th hidden layer and A denotes the number of output neurons 

The moments of the time senes defined above form the training attributes to the network 
They are normalized between 0 1 and 0 9 before feeding them to the network The target 
output, for training, is made to be a five-dimensional vector The dimension of the target 
output vector is equal to the number of primary faults for which the data is available 
Since, five types of primary faults were introduced in the laboratory ng the dimension of 
the target output vector was made to be five The sixth fault is a combination of mass 
unbalance and misalignment The target output vector is chosen to have a value equal to 
either 1 or 0, m each dimension A value equal to 1 in a dimension is symbolic of the 
presence of a particular fault, while a value equal to zero, signifies its absence 

Tra inin g is earned out with the following sets of data - 

1 Central Moments of |z(?)| 

2 Central Moments of \dz(t )| 

3 Combined central moments of |z(f)j and 

4 Cross moments of the time senes z(7) 

The Table 4 1 desenbes the training scheme with the central moments of \z(t)\ 
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Table 4.1 Training data 


No 

Input Vector 

Out 

put Vector 


ml 

m2 

m3 

m4 



COT 

MU 



1 

3 825 

5 497 

50 216 

201 730 



0 000 

1 000 

0 000 

iisi 


EIS3I 

35 538 

407 055 

3702 857 

40905 828 

wm 

1 000 


SSI 


mm 

MZM 



5 597 

19 738 

■m 

0 000 

imsi 

MSI 

wmm 

4 


ilfl 


163 432 

974 312 

HiVtiiiil 

B885M 

MTBHil 

1>8M»1 

gjjgHi 

5 

1 348 

■ 

7 460 

22 315 

91 944 

0 000 


EB 

■ainw 

mm 

6 

3 144 

Kzzm 

37 044 

150 688 

858 668 

1 000 

ebb 

B888551 

m 


7 


WSEM 

msmm 

4 753 

17 142 

0 000 


■1.T.T11 


UiMtl 

8 


5 196 

mm 

150 329 

866 723 

0 000 



m 


9 

WSEM 

0 695 


7 054 

26 745 

0 000 

0 000 


1 000 

0 000 

10 

mxm 

7 322 

71 231 

329 811 

2471 105 


0 000 

BilifflM 

0 000 

1 000 

mm 

4 063 

7 876 


338 588 j 

2596 214 

0 000 

0 000 



1 000 

mm 

BBS 



182 959 

1142 190 

0 000 

1 000 

0 000 

0 000 

0 000 

13 


mmm 


173 109 



■niimi 

1 000 


1 000 

14 

EEH 


62 638 

267 333 



mum 

IBiBBl 


0 000 

mm 



70 051 

251 523 



mm 


BB 

0 000 

m 

4 174 

7 779 

68 666 

334 536 

2387 193 

pfiW«1 

n 

0 000 

0 000 

1 000 

mm 


1 1SM 

24 412 

120 046 

716 274 


MEM 


1 000 

eeh 

18 


BttkX 

EfflyfE 

253 583 

1866 117 

E%y?i 


188551 

0 000 

DEB 

19 


5 560 


IE%£E5fE 

1369 507 


BH 

H551 

B&X 

mm 

20 

3 605 

5 023 


Em 

1082 105 


0 000 

ipigwi 

BUSl 

m 


ml-m5 

moments 



NF 

No~fault 

cou 

Coupling loose MU Mass unbalance 

BCL 

Bearing cap loose 

MA 

Misalignment 


Six hundred such data are used for training the network Initially all the weights in the 
network are adjusted between -0 5 and 0 5 randomly Once the learning process starts, the 
neural network is so designed that the weights and the thresholds between different layers 
adjust automatically, so as to minimiz e the mean square error, between the actual network 
output and the targeted output In minimizin g the error, the other network parameters like 
moment, learning rate, number of hidden layers and nodes m each layer are adjusted The 
output vectors which are generated by the network do not always consists of exactly 
either 0 or 1 If a dimension value is greater than or equal to 0 6 it is considered as 1 
Similarly, if it is less than or equal to 0 3 it is considered as 0 If the vector consists of a 
quantity less than 0 6 in all dimension the prediction of fault is ambiguous The network 
algorithm is trained for different architectures and training schemes 
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4.4. Network Training With First Five Moments Of \z{t)\ 

The training process was started with the mean and the second to fifth central moments of 
\z(t)\ The network needs to (5jJ,j 2, jn, 5) architecture - since there are five mput 

nodes, each node corresponding to a moment mput, and an output layer consisting of 5 
nodes, each node corresponding to one of the five primary faults under consideration The 
number of hidden layers can be varied The number of neurons m each hidden layer can 
also be varied A total of 600 samples, comprising of 100 samples for each fault, was used 
for the training the network A numerical value is required initially, to quantitatively define 
the accuracy of the training process A training process will be successful if it is able to 
correctly identify the fault The degree of such accuracy, called the target error, is the 
mean square difference between a target value (0 or 1) and the achieved output To start 
with, the achievable accuracy was arbitrarily chosen as 0 01 A network with two hidden 
layers, each with five neurons (network architecture 5, 5, 5, 5) was tried The convergence 
to the value of 0 01 was not achieved even after 2000 iterations (see the error graph of 
Fig 4 11) The number of hidden layers and the number of neurons in each layer was 
varied The mean square error achievable was in the vicinity of 0 7 When the network, 
thus tramed (to a mean square error accuracy of 0 7), was given a fresh set of mputs, 
obtained from various faulty vibration signatures (faults being unknown to the network, 
since these mput sets did not form a part of the training data), it was found that the 
network had failed to correctly identify the faults m a significant way The testing success 
was only between 35 to 45% The performance of the various network architectures (for 
learning rate 0 2 and moment 0 8) is shown in Table 4 2 (about 15 such architectures were 
tried, five best are given in the table) The general observation was that the overall 

performance of the network with first five moments of the \z{t )| was between 35 to 45% 
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Fig 4 1 ] Extor graph, for the network with 5 moments 


Table 4.2. Different network architectures and testing success 
for the network with first five moments of \z(t)\ 


No 

Network 

Architecture 

Target 

Error 

Achieved 

Error 

Iterations 

Testing 

Success 

1 

5,5, 5,5 

■GEJH 


2000 

35 0% 

2 

5, 8,8,5 


■EBEm 

2000 

42 5% 

3 

5,6,6, 5 

0 01 

WtSEBSKk 

2000 

41 0°/o 

4 

5,10,5 

0 01 

■EEES9 1 

■Hm 


5 

5 ,6,6,5 

0 01 

0 7043 

2000 



4.5. Network Training With First Five Moments Of \dz(t)\ 

The training was earned in a manner similar to that adopted for the moments of z(t ) The 
mean, second to fifth central moment of |dfe(t)| were used as mputs The training was 
initially earned out with a set of 600 samples and a target error of 0 01 Convergence 
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could not be achieved in this case also The error was never below 0 6 for all architectures 
that were trained and tested The success rate was 50 to 60% The different network 
architectures and testing success are shown in Table 4 3 (about 15 such architectures were 
tried, five best are given in the table) 


Table 4.3. Different network architectures and testing success 


for the network with first five moments of |<iz(t)| 


No 

Network 

Architecture 

Target 

Error 

Achieved 

Error 

Iterations 

Testing 

Success 

1 

5,5,5 

0 01 

0 7613 

2000 

50 0% 

2 

5,10,5 

0 01 

0 7109 

2000 

52 5% 

3 

5,10,5 

0 01 

0 7004 

2000 

57 5% 

4 

5, 8,8,5 

0 01 

0 7426 

2000 

57 5% 

5 

5,6, 6,5 

0 01 

0 7446 

2000 

60 0% 


4.6. Network Training With Moments Of both \z(t)\ \cb(t)\ 

An attempt was made to improve the convergence of network by performing the training 
with the moments of jz(t)j and \z(t)\ simultaneously The training w'as earned out for a 

(10,10,5) architecture (the input layer now has 10 neurons since there are 10 input 
vectors) with a target error of 0 01 It was observed that the mean square decreased to 

about 0 46 which was lower than the error obtained with moments of |z(?)| alone or 
|z(7)| alone On testing the network thus trained it was found that it was able to correctly 
identify the faults in about 60% of the cases Table 4 4 gives the results obtained for such 
tests 


Table 4.4. Performance of (10,10,5) architecture 


No 

Network 

Architecture 

Moment 

Learning 

Rate 

Target 

Error 

Achieved 

Error 

Iteration 

s 

Testing 

Success 

m 

10,10,5 


0 10 

0 01 

IIEE&OQBi 

2000 

60 00% 


10.10.5 



0 01 



61 25% 

n 

■H3ESNI 

0 80 

0 10 

0 01 

0 4055 

2000 

65 00% 



0 90 

0 20 

0 01 1 

0 4628 

2000 

65 00% 

d= 

■bushmi 

0 80 

0 25 

0 01 

0 4823 

2000 

68 67% 
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Since the results were encouraging an additional hidden layer with 10 neurons was 
introduced The architecture now was (10,10 10,5) The performance of this architecture 
was noted for vanous learning rates Fig 4 12 graphically shows the decreasing error 
trends for (10,10,10,5) architecture for two different learning rates A learning rate of 0 25 
generated the highest success of 83% The performance of this training scheme is given m 
Table 4 5 



Nn nf iterations 





Table 4.5. Comparison of network output with actual output 
for the network with 10 inputs 


No 
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work on 

itput 


Actual Output 



cou 

MU 

BCL 

MA 

NF 
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MU 
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MA 
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— 
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w§m 


6 






EM 

■EM 

■Cm 

ife® 


7 

0 000 

1 000 

0 000 


| 

mum 


0 000 

E 


8 

0 006 

0 003 

0 640 

0 004 

■still 

ESI 

0 000 

0 000 


Ifffll 


0 953 

0 003 

0 048 

0 001 

| 




E 

a 

mm 

■new 

mmm 

HiaW 

WBlmm 

TUtliM 

■esi 


— ■ 


a 

Ui_ 

MW 

nm 



■ilililiM 


P|f^H 

0 000 

wmm 

EEM 

mm 

litliti 

0 002 


MEM 



11231 


1.I.T.T.1 

0 000 

mm 

0 030 

0 002 

0 663 

mm 


■HI 


ibm 


1 000 

KOI 


■Mil 

■EM 


1 

■raj 


H131 

1 000 

0 000 



IB 

| 




mm 

mm 

0 000 

1 000 

U6 

0 000 

0 002 


buliM 

0 513 

0 000 

■ 

iiiiiiii 


mill 

mm 

0 000 

1 000 




0 000 

i 

0 000 

mmm 

ipua 

mm 

0 081 

0 002 


KOI 

IiW 

■EM 

■em 

1 000 



na 


0 000 

0 944 

| 


liWtW 


1 000 

EE21 


EB1 

0 034 



■ 

0 041 



0 000 

0 000 

o 

o 

o 

_ 

El 

0 006 

0 901 

0 080 

| 

HI 


1 000 

0 000 

0 000 

0 000 1 

22 
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0 000 
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| tJjtjj-M 


■em 

| 

0 000 


ilBlilil 
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■EM 

0 000 

1 000 

0 000 1 

25 
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0 042 

0312 



0 000 
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4.7. Network Training With Cross Moments 

Training was also attempted for sets of data comprising of the mean, the second, third and 
fourth order central moments of horizontal and vertical displacements along with six cross 
moments upto the fourth order The number of input nodes m this case becomes 14, while 
the output nodes remain equal to 5 Architectures with two hidden layers were trained and 
tested The number of neurons in each hidden layer was kept variable It was found that 
the best results were obtained with 14 neurons in each hidden layer Increasing the number 
of neurons further resulted in overtraining and decreased the success rate The learning 
rate and moment were now varied and the network performance was recorded Table 4 6 
gives this record 
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Table 4.6. Different network architectures and testing success 
for the network with 14 input 


No 

Network 

Architecture 

Moment 

Learning 

Rate 

Target 

Error 

Achieved 

Error 


Testing 

Success 

1 

14,14,14 5 


0 15 

0 01 

0 0334 

2000 

80 00% 

2 

14,14,14,5 


0 15 

0 01 

■eeesb 

2000 

80 00% 

3 

14,14,14,5 


0 10 

0 01 

mmim 

HEEEIlN 

82 85% 

mm 

14,14,14,5 

0 98 


mlim 

if 

mam 

■aaf 



0 90 


MEM 


2000 

EEB 

6 

14,14,14,5 

0 90 

0 40 

0 01 

■OHOI 

2000 

88 57% 

7 

14,14,14,5 

0 85 

0 15 

0 01 

0 1005 

2000 

90 00% 

8 

14,14,14,5 

0 80 

0 10 

0 01 

0 0545 

2000 

91 42% 


It can be seen from Table 5 5 that for a value of moment equal to 0 8 and learning rate 
equal to 0 1, the network converges to a mean square error of 0 05 The error graph (Fig 
4 13) shows the decreasing error with the number of iterations performed The success 
rate m this case is found to be 91% A comparison of the network output and the actual 
output for fault identification in this case is shown in Table 4 7 
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Table 4.7. Comparison of network output with actual output 
for the network with 14 inputs 


No 

Network On 

itput 



Actu 

al Output 
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■ 

— ■ 
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|il«m 

1—1 

gym 

BHI 

0 000 


MSWM 
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mmm 
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4.8. Remarks 

It was observed from the above training schemes that, the network had failed to identify 
the mass unbalance In some cases, mass unbalance was identified as a combination of 
mass unbalance and misalignment In all cases the network successfully identified the 
samples that are fault-free It was also observed that bearing cap loose case mi play in 
coupling were identified very accurately by the network In some cases network had 
minor problems in identifying the combination of mass unbalance and misalignment The 
overall performance of the network can be said to he approximately between 75 to 85% 























































































CONCLUSIONS 


The present study needs to be viewed as a preliminary attempt at making use of the 
Artificial Neural Network technology for fault prediction in rotating machinery During 
the course of this work focus was kept on the most crucial aspect of data generation foi 
healthy and faulty rotor system The well established technique of using statistical 
moments of the system signatures for network training were adopted Faults were 
dehberately introduced m the rotor system and vibration signals were picked up using 
piezo-electnc sensors The vibration signals were then statistically processed to obtain the 
moments to be fed as inputs to the training algorithm The ANN simulator was written in 
ANSI-C, trained, tested and vahdated The results obtained are encouraging and lead to 
optimism on the possible use of ANNs as an effective tool for rotating machinery fault 
diagnosis Tins would, however, require a more ngourous training environment - which 
could vary the training parameters hke learning rate, moment automatically or m an 
interactive manner and also make a comparison between the effectiveness of various sets 
of inputs, to obtain best possible results All this can be taken up as future development oJ 
the present work The basic framework of the algorithm can remain the same Additiona 
modifications and alterations can be made wherever necessaiy The simulator may need tc 
be refined by additional experimental data generation also The possibility of incorporating 
the present algori thm, which is presently written in ANSI-C, as a Virtual Instrument icon 
in a Virtual Instrumentation Software is very attractive This can result in a package which 
combines data acquisition, processing and fault prediction through an ANN 
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